Developer(s) | Community, and developers from CollabNet, Elego, VisualSVN, WANdisco |
---|---|
Initial release | October 20, 2000 |
Development status | Active |
Written in | C |
Operating system | Cross-platform |
Type | Revision control |
License | Apache License |
Website | http://subversion.apache.org/ |
In software development, Apache Subversion (formerly called Subversion[1], command name svn (Swedish pronunciation: [svɛn]) ) is a revision control system founded and sponsored in 2000 by CollabNet Inc. Developers use Subversion to maintain current and historical versions of files such as source code, web pages, and documentation. Its goal is to be a mostly-compatible successor to the widely used Concurrent Versions System (CVS).
The open source community has used Subversion widely: for example in projects such as Apache Software Foundation, Free Pascal, FreeBSD, GCC, Django, Ruby, Mono, SourceForge, ExtJS, Tigris.org, PHP, Python and MediaWiki. Google Code also provides Subversion hosting for their open source projects. BountySource systems use it exclusively. CodePlex offers access to Subversion as well as to other types of clients.
The corporate world has also started to adopt Subversion. A 2007 report by Forrester Research recognized Subversion as the sole leader in the Standalone Software Configuration Management (SCM) category and as a strong performer in the Software Configuration and Change Management (SCCM) category.[2]
Subversion uses the Apache License, making it open source.
Contents |
The Subversion project was founded by CollabNet in 2000 as an effort to write an open-source version-control system which operated much like CVS but which fixed the bugs and supplied some features missing in CVS. By 2001, Subversion had advanced sufficiently to host its own source code.[3] In November 2009 Subversion was accepted into Apache Incubator: this marked the beginning of the process to become a standard top-level Apache project.[4] It became a top-level Apache project on February 17, 2010.[5]
Subversion offers two types of repository storage — FSFS and Berkeley DB.
FSFS works faster on directories with a large number of files and takes less disk space, due to less logging.[6] Beginning with Subversion 1.2, FSFS is the default data store for new repositories.
Subversion has some limitations with Berkeley DB usage leading to repository corruption and data loss when a program that accesses the database crashes or terminates forcibly. When using Berkeley DB repository, the only way to use it safely is on the dedicated server and by a single server process running as one user, according to Version Control with Subversion.[7] Existing tools for Berkeley DB repository recovery aren't completely reliable, so system administrators need to make frequent repository backups.
Access to Subversion repositories can take place by the following means:
All three means can access both FSFS and Berkeley DB repositories.
Any 1.x version of a client can work with any 1.x server. Newer clients and servers have additional features and performance capabilities, but have fallback support for older clients/servers.[9]
Internally, a Subversion system comprises several libraries arranged as layers. Each performs a specific task and allows developers to create their own tools at the desired level of complexity and specificity.
One can view the Subversion filesystem as "two-dimensional"[10]. Two coordinates are used to unambiguously address filesystem items:
Each revision in a Subversion filesystem has its own root, which is used to access contents at that revision. Files are stored as links to the most recent change; thus a Subversion repository is quite compact. The system consumes storage space proportional to the number of changes made, not to the number of revisions.
The Subversion filesystem uses transactions to keep changes atomic. A transaction operates on a specified revision of the filesystem, not necessarily the latest. The transaction has its own root, on which changes are made. It is then either committed and becomes the latest revision, or is aborted. The transaction is actually a long-lived filesystem object; a client does not need to commit or abort a transaction itself, rather it can also begin a transaction, exit, and then can re-open the transaction and continue using it. Multiple clients can access the same transaction and work together on an atomic change, though no existing clients expose this capability.
One important feature of the Subversion filesystem is properties: simple name=value pairs of text. Properties occur in two different places in the Subversion filesystem. The first is on filesystem entries (i.e., files and directories). These are versioned just like other changes to the filesystem. Users can add any property they wish, and the Subversion client uses a set of properties, which it prefixes with 'svn:'.
Subversion also uses properties on revisions themselves. Like the above properties on filesystem entries the names are completely arbitrary, with the Subversion client using certain properties prefixed with 'svn:'. However, these properties are not versioned and can be changed later.
Subversion uses the inter-file branching model from Perforce[13] to handle branches and tags. A branch is a separate line of development.[14] Tagging involves creating a snapshot of the repository's content, which, unlike a branch, is not expected to change in the future.
The system sets up a new branch or tag by using the 'svn copy' command, which should be used in place of the native operating system mechanism. Subversion does not create an entire new file version in the repository with its copy. Instead, the old and new versions are linked together internally and the history is preserved for both. The copied versions take up only a little extra room in the repository because Subversion saves only the differences from the original versions.
All the versions in each branch maintain the history of the file up to the point of the copy, plus any changes made since. One can "merge" changes back into the trunk or between branches. To Subversion, the only difference between tags and branches is that changes should not be checked into the tagged versions. Due to the differencing algorithm, creating a tag or a branch takes very little additional space in the repository.
Release 1.6.* of Subversion only allows directory access control and lacks more granular file access control. That problem dramatically restricts the use of Subversion in projects where directories are not structured to address the functional separation among various objects. For example, directories like lib, src, bin do not address security and access control in most cases.
A known problem in Subversion affects the implementation of the file and directory rename operation. As of 2010[update], Subversion implements the renaming of files and directories as a "copy" to the new name followed by a "delete" of the old name. Only the names change, all data relating to the edit history remains the same, and Subversion will still use the old name in older revisions of the "tree". However, Subversion may become confused when files are modified and moved in the same commit. This can also cause problems when a move conflicts with edits made elsewhere,[15] for example during merging branches.[16] The Subversion 1.5 release addressed some of these scenarios while others remain problematic.[17]
As of 2010[update] Subversion lacks some repository-administration and -management features. For instance, someone may wish to edit the repository to permanently remove all historical records of certain data. Subversion does not have built-in support to achieve this simply.[18]
Subversion stores additional copies of data on the local machine, which can become an issue with very large projects or files, or if developers work on multiple branches simultaneously. These .svn directories on the client side can become corrupted by ill-advised user activity.[19] The "svn cleanup" command helps in case of internal lock caused by unfinished operation.
Subversion does not store the modification times of files. As such, a file checked out of a subversion repository will have the 'current' date (instead of the modification time in the repository), and a file checked into the repository will have the date of the check-in (instead of the modification time of the file being checked in). This might not always be what is wanted.[20]
Subversion does not use a distributed revision control model. Ben Collins-Sussman, one of the designers of Subversion, believes a centralised model would help prevent "insecure programmers" from hiding their work from other team members.[21]
While Subversion stores filenames as Unicode, it does not specify if precomposition or decomposition is used for certain accented characters (such as é). Thus, files added in SVN clients running on some operating systems (such as OS X) use decomposition encoding, while clients running on other operating systems (such as Linux) use precomposition encoding, with the consequence that those accented characters do not display correctly if the local SVN client is not using the same encoding as the client used to add the files.[22][23]
By design, the svn log
command is always recursive: trying to access the history of a directory systematically pulls out the history of its entire hierarchy. A workaround is not to use the command line but use a sophisticated SVN client with filtering capabilities.
This subsection focuses on tags but parts of it also apply to branches.
Revision numbers are difficult to remember in any version-control system. For this reason most systems offer symbolic tags as user-friendly references to them. Subversion does not have such a feature and what its documentation recommends to use instead is very different in nature. Instead of implementing tags as references to points in history, Subversion recommends taking snapshots and storing them in a well-known subdirectory ("tags/
") in the space of the repository tree.
This history-to-space projection causes at least two problems:
1. Once a snapshot has been taken, the system does not remember which revision it came from. This is the difference between a copy and a reference. This makes some operations less convenient and others impossible. For instance a naive svn diff -r tag1:tag2 myfile
does not work; it is slightly more complicated than that to achieve. Other operations like for instance svn log -r tag1:tag2 myfile
are just impossible.
2. When two (ideally independent) object types live in the repository tree, a "fight to the top" can ensue. In other words it is often difficult to decide at which level to create the "tags/
" subdirectory:
trunk/componentfoo/ /componentbar/ tags/1.1/componentfoo/ /componentbar/ |
or |
componentfoo/trunk/ /tags/1.1/ componentbar/trunk/ /tags/1.1/ |
To address such problems, posters on the Subversion mailing lists have suggested a new feature called "labels".[24] SVN labels would more closely resemble the "tags" of other systems such as CVS or git. As of 2010[update] this suggestion has not progressed.
CollabNet has continued its involvement with Subversion, but the project runs as an independent open source community. In November 2009 the project was accepted into the Apache Incubator, aiming to become part of the Apache Software Foundation's efforts.[25] Since March 2010, the project is formally known as Apache Subversion, being a part of the Apache Top-Level Projects.[26]
In October 2009 WANdisco announced the hiring of core Subversion committers as the company moved to become a major corporate sponsor of the project. This included Hyrum Wright, president of the Subversion Corporation and release manager for the Subversion project since early 2008, who joined the company to lead its open source team.[27]
The Subversion open-source community does not provide binaries, but potential users can download such from volunteers, from CollabNet, the initiator of the Subversion project or WANdisco. While the Subversion project does not include an official graphical user interface (GUI) for use with Subversion, third parties have developed a number of different GUIs, along with a wide variety of additional ancillary software.
The Subversion committers normally have at least one or two new features under active development at any one time. As of January 2010 the major goals for the next version of Subversion included a streamlined HTTP transport to improve performance and a rewritten working-copy library.[28]
Other work in progress includes SubversionJ (a JAVA API) and implementation of the Obliterate command, similar to that provided by Perforce. Both of these enhancements are being sponsored by WANdisco.[29]
|
|